Overview

Dataset statistics

Number of variables70
Number of observations15000
Missing cells3703
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.0 MiB
Average record size in memory560.0 B

Variable types

Numeric7
Categorical63

Alerts

country has a high cardinality: 137 distinct values High cardinality
Q1 is highly correlated with Q4High correlation
Q4 is highly correlated with Q1High correlation
Q7 is highly correlated with Q12High correlation
Q11 is highly correlated with Q19High correlation
Q12 is highly correlated with Q7High correlation
Q19 is highly correlated with Q11High correlation
testelapse is highly correlated with surveyelapseHigh correlation
surveyelapse is highly correlated with testelapseHigh correlation
TIPI1 is highly correlated with TIPI6High correlation
TIPI4 is highly correlated with TIPI9High correlation
TIPI6 is highly correlated with TIPI1High correlation
TIPI9 is highly correlated with TIPI4High correlation
VCL1 is highly correlated with VCL4 and 3 other fieldsHigh correlation
VCL4 is highly correlated with VCL1 and 2 other fieldsHigh correlation
VCL10 is highly correlated with VCL1 and 1 other fieldsHigh correlation
VCL15 is highly correlated with VCL1 and 3 other fieldsHigh correlation
VCL16 is highly correlated with VCL1 and 2 other fieldsHigh correlation
education is highly correlated with ageHigh correlation
age is highly correlated with education and 1 other fieldsHigh correlation
married is highly correlated with ageHigh correlation
Q1 is highly correlated with Q4High correlation
Q4 is highly correlated with Q1High correlation
Q7 is highly correlated with Q12High correlation
Q11 is highly correlated with Q19High correlation
Q12 is highly correlated with Q7High correlation
Q19 is highly correlated with Q11High correlation
TIPI1 is highly correlated with TIPI6High correlation
TIPI4 is highly correlated with TIPI9High correlation
TIPI6 is highly correlated with TIPI1High correlation
TIPI9 is highly correlated with TIPI4High correlation
VCL1 is highly correlated with VCL4 and 3 other fieldsHigh correlation
VCL4 is highly correlated with VCL1 and 2 other fieldsHigh correlation
VCL10 is highly correlated with VCL1 and 1 other fieldsHigh correlation
VCL15 is highly correlated with VCL1 and 3 other fieldsHigh correlation
VCL16 is highly correlated with VCL1 and 2 other fieldsHigh correlation
Q1 is highly correlated with Q4High correlation
Q4 is highly correlated with Q1High correlation
Q11 is highly correlated with Q19High correlation
Q19 is highly correlated with Q11High correlation
TIPI4 is highly correlated with TIPI9High correlation
TIPI9 is highly correlated with TIPI4High correlation
VCL1 is highly correlated with VCL4 and 3 other fieldsHigh correlation
VCL4 is highly correlated with VCL1 and 2 other fieldsHigh correlation
VCL10 is highly correlated with VCL1 and 1 other fieldsHigh correlation
VCL15 is highly correlated with VCL1 and 3 other fieldsHigh correlation
VCL16 is highly correlated with VCL1 and 2 other fieldsHigh correlation
education is highly correlated with ageHigh correlation
age is highly correlated with educationHigh correlation
VCL10 is highly correlated with VCL1 and 1 other fieldsHigh correlation
VCL16 is highly correlated with VCL4 and 2 other fieldsHigh correlation
VCL4 is highly correlated with VCL16 and 2 other fieldsHigh correlation
VCL1 is highly correlated with VCL10 and 3 other fieldsHigh correlation
VCL15 is highly correlated with VCL10 and 3 other fieldsHigh correlation
Q1 is highly correlated with Q3 and 1 other fieldsHigh correlation
Q2 is highly correlated with Q12 and 1 other fieldsHigh correlation
Q3 is highly correlated with Q1 and 4 other fieldsHigh correlation
Q4 is highly correlated with Q1High correlation
Q5 is highly correlated with Q9 and 2 other fieldsHigh correlation
Q6 is highly correlated with Q9 and 3 other fieldsHigh correlation
Q7 is highly correlated with Q12 and 1 other fieldsHigh correlation
Q8 is highly correlated with Q13 and 2 other fieldsHigh correlation
Q9 is highly correlated with Q5 and 4 other fieldsHigh correlation
Q11 is highly correlated with Q19High correlation
Q12 is highly correlated with Q2 and 2 other fieldsHigh correlation
Q13 is highly correlated with Q6 and 6 other fieldsHigh correlation
Q14 is highly correlated with Q20High correlation
Q15 is highly correlated with Q2High correlation
Q16 is highly correlated with Q3 and 2 other fieldsHigh correlation
Q17 is highly correlated with Q3 and 5 other fieldsHigh correlation
Q19 is highly correlated with Q11High correlation
Q20 is highly correlated with Q13 and 1 other fieldsHigh correlation
Q22 is highly correlated with Q13 and 1 other fieldsHigh correlation
Q23 is highly correlated with Q6 and 3 other fieldsHigh correlation
Q24 is highly correlated with Q5 and 2 other fieldsHigh correlation
Q25 is highly correlated with Q7 and 1 other fieldsHigh correlation
Q26 is highly correlated with Q5 and 3 other fieldsHigh correlation
introelapse is highly correlated with testelapseHigh correlation
testelapse is highly correlated with introelapseHigh correlation
TIPI1 is highly correlated with Q3 and 3 other fieldsHigh correlation
TIPI3 is highly correlated with TIPI8High correlation
TIPI4 is highly correlated with TIPI9High correlation
TIPI6 is highly correlated with Q3 and 2 other fieldsHigh correlation
TIPI8 is highly correlated with TIPI3High correlation
TIPI9 is highly correlated with TIPI4High correlation
VCL1 is highly correlated with VCL2 and 5 other fieldsHigh correlation
VCL2 is highly correlated with VCL1 and 6 other fieldsHigh correlation
VCL3 is highly correlated with VCL7 and 2 other fieldsHigh correlation
VCL4 is highly correlated with VCL1 and 6 other fieldsHigh correlation
VCL5 is highly correlated with VCL1 and 6 other fieldsHigh correlation
VCL7 is highly correlated with VCL3 and 2 other fieldsHigh correlation
VCL8 is highly correlated with VCL7 and 1 other fieldsHigh correlation
VCL10 is highly correlated with VCL1 and 5 other fieldsHigh correlation
VCL11 is highly correlated with VCL3 and 2 other fieldsHigh correlation
VCL13 is highly correlated with VCL2 and 2 other fieldsHigh correlation
VCL14 is highly correlated with VCL2 and 4 other fieldsHigh correlation
VCL15 is highly correlated with VCL1 and 6 other fieldsHigh correlation
VCL16 is highly correlated with VCL1 and 4 other fieldsHigh correlation
education is highly correlated with votedHigh correlation
voted is highly correlated with educationHigh correlation
country has 190 (1.3%) missing values Missing
education has 167 (1.1%) missing values Missing
religion has 245 (1.6%) missing values Missing
orientation has 399 (2.7%) missing values Missing
familysize has 319 (2.1%) missing values Missing
introelapse is highly skewed (γ1 = 48.40398766) Skewed
testelapse is highly skewed (γ1 = 44.46473054) Skewed
surveyelapse is highly skewed (γ1 = 82.28836108) Skewed
age is highly skewed (γ1 = 122.1567763) Skewed
familysize is highly skewed (γ1 = 120.5748218) Skewed
df_index is uniformly distributed Uniform
df_index has unique values Unique

Reproduction

Analysis started2022-08-02 17:36:16.552782
Analysis finished2022-08-02 17:37:34.891183
Duration1 minute and 18.34 seconds
Software versionpandas-profiling v3.2.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

UNIFORM
UNIQUE

Distinct15000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7499.5
Minimum0
Maximum14999
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size117.3 KiB

Quantile statistics

Minimum0
5-th percentile749.95
Q13749.75
median7499.5
Q311249.25
95-th percentile14249.05
Maximum14999
Range14999
Interquartile range (IQR)7499.5

Descriptive statistics

Standard deviation4330.271354
Coefficient of variation (CV)0.5774080077
Kurtosis-1.2
Mean7499.5
Median Absolute Deviation (MAD)3750
Skewness0
Sum112492500
Variance18751250
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01
 
< 0.1%
100041
 
< 0.1%
99921
 
< 0.1%
99931
 
< 0.1%
99941
 
< 0.1%
99951
 
< 0.1%
99961
 
< 0.1%
99971
 
< 0.1%
99981
 
< 0.1%
99991
 
< 0.1%
Other values (14990)14990
99.9%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
ValueCountFrequency (%)
149991
< 0.1%
149981
< 0.1%
149971
< 0.1%
149961
< 0.1%
149951
< 0.1%
149941
< 0.1%
149931
< 0.1%
149921
< 0.1%
149911
< 0.1%
149901
< 0.1%

Q1
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing41
Missing (%)0.3%
Memory size117.3 KiB
5.0
5939 
4.0
4924 
3.0
2444 
2.0
1055 
1.0
597 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44877
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row4.0
3rd row4.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.05939
39.6%
4.04924
32.8%
3.02444
16.3%
2.01055
 
7.0%
1.0597
 
4.0%
(Missing)41
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.05939
39.7%
4.04924
32.9%
3.02444
16.3%
2.01055
 
7.1%
1.0597
 
4.0%

Most occurring characters

ValueCountFrequency (%)
.14959
33.3%
014959
33.3%
55939
 
13.2%
44924
 
11.0%
32444
 
5.4%
21055
 
2.4%
1597
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29918
66.7%
Other Punctuation14959
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014959
50.0%
55939
 
19.9%
44924
 
16.5%
32444
 
8.2%
21055
 
3.5%
1597
 
2.0%
Other Punctuation
ValueCountFrequency (%)
.14959
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44877
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14959
33.3%
014959
33.3%
55939
 
13.2%
44924
 
11.0%
32444
 
5.4%
21055
 
2.4%
1597
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII44877
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14959
33.3%
014959
33.3%
55939
 
13.2%
44924
 
11.0%
32444
 
5.4%
21055
 
2.4%
1597
 
1.3%

Q2
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing69
Missing (%)0.5%
Memory size117.3 KiB
5.0
7236 
4.0
4129 
3.0
1688 
2.0
1067 
1.0
811 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44793
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row4.0
3rd row5.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.07236
48.2%
4.04129
27.5%
3.01688
 
11.3%
2.01067
 
7.1%
1.0811
 
5.4%
(Missing)69
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.07236
48.5%
4.04129
27.7%
3.01688
 
11.3%
2.01067
 
7.1%
1.0811
 
5.4%

Most occurring characters

ValueCountFrequency (%)
.14931
33.3%
014931
33.3%
57236
16.2%
44129
 
9.2%
31688
 
3.8%
21067
 
2.4%
1811
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29862
66.7%
Other Punctuation14931
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014931
50.0%
57236
24.2%
44129
 
13.8%
31688
 
5.7%
21067
 
3.6%
1811
 
2.7%
Other Punctuation
ValueCountFrequency (%)
.14931
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44793
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14931
33.3%
014931
33.3%
57236
16.2%
44129
 
9.2%
31688
 
3.8%
21067
 
2.4%
1811
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII44793
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14931
33.3%
014931
33.3%
57236
16.2%
44129
 
9.2%
31688
 
3.8%
21067
 
2.4%
1811
 
1.8%

Q3
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing50
Missing (%)0.3%
Memory size117.3 KiB
5.0
7626 
4.0
4788 
3.0
1256 
2.0
801 
1.0
 
479

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44850
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row4.0
3rd row5.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.07626
50.8%
4.04788
31.9%
3.01256
 
8.4%
2.0801
 
5.3%
1.0479
 
3.2%
(Missing)50
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.07626
51.0%
4.04788
32.0%
3.01256
 
8.4%
2.0801
 
5.4%
1.0479
 
3.2%

Most occurring characters

ValueCountFrequency (%)
.14950
33.3%
014950
33.3%
57626
17.0%
44788
 
10.7%
31256
 
2.8%
2801
 
1.8%
1479
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29900
66.7%
Other Punctuation14950
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014950
50.0%
57626
25.5%
44788
 
16.0%
31256
 
4.2%
2801
 
2.7%
1479
 
1.6%
Other Punctuation
ValueCountFrequency (%)
.14950
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44850
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14950
33.3%
014950
33.3%
57626
17.0%
44788
 
10.7%
31256
 
2.8%
2801
 
1.8%
1479
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII44850
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14950
33.3%
014950
33.3%
57626
17.0%
44788
 
10.7%
31256
 
2.8%
2801
 
1.8%
1479
 
1.1%

Q4
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing71
Missing (%)0.5%
Memory size117.3 KiB
5.0
5512 
4.0
4313 
3.0
2564 
2.0
1619 
1.0
921 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44787
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row4.0
3rd row4.0
4th row2.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.05512
36.7%
4.04313
28.8%
3.02564
17.1%
2.01619
 
10.8%
1.0921
 
6.1%
(Missing)71
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.05512
36.9%
4.04313
28.9%
3.02564
17.2%
2.01619
 
10.8%
1.0921
 
6.2%

Most occurring characters

ValueCountFrequency (%)
.14929
33.3%
014929
33.3%
55512
 
12.3%
44313
 
9.6%
32564
 
5.7%
21619
 
3.6%
1921
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29858
66.7%
Other Punctuation14929
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014929
50.0%
55512
 
18.5%
44313
 
14.4%
32564
 
8.6%
21619
 
5.4%
1921
 
3.1%
Other Punctuation
ValueCountFrequency (%)
.14929
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44787
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14929
33.3%
014929
33.3%
55512
 
12.3%
44313
 
9.6%
32564
 
5.7%
21619
 
3.6%
1921
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII44787
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14929
33.3%
014929
33.3%
55512
 
12.3%
44313
 
9.6%
32564
 
5.7%
21619
 
3.6%
1921
 
2.1%

Q5
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing38
Missing (%)0.3%
Memory size117.3 KiB
5.0
5619 
4.0
4833 
3.0
2153 
2.0
1458 
1.0
899 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44886
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row4.0
3rd row3.0
4th row4.0
5th row3.0

Common Values

ValueCountFrequency (%)
5.05619
37.5%
4.04833
32.2%
3.02153
 
14.4%
2.01458
 
9.7%
1.0899
 
6.0%
(Missing)38
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.05619
37.6%
4.04833
32.3%
3.02153
 
14.4%
2.01458
 
9.7%
1.0899
 
6.0%

Most occurring characters

ValueCountFrequency (%)
.14962
33.3%
014962
33.3%
55619
 
12.5%
44833
 
10.8%
32153
 
4.8%
21458
 
3.2%
1899
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29924
66.7%
Other Punctuation14962
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014962
50.0%
55619
 
18.8%
44833
 
16.2%
32153
 
7.2%
21458
 
4.9%
1899
 
3.0%
Other Punctuation
ValueCountFrequency (%)
.14962
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44886
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14962
33.3%
014962
33.3%
55619
 
12.5%
44833
 
10.8%
32153
 
4.8%
21458
 
3.2%
1899
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII44886
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14962
33.3%
014962
33.3%
55619
 
12.5%
44833
 
10.8%
32153
 
4.8%
21458
 
3.2%
1899
 
2.0%

Q6
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing48
Missing (%)0.3%
Memory size117.3 KiB
4.0
4846 
5.0
4336 
3.0
3238 
2.0
1659 
1.0
873 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44856
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4.0
2nd row5.0
3rd row5.0
4th row3.0
5th row3.0

Common Values

ValueCountFrequency (%)
4.04846
32.3%
5.04336
28.9%
3.03238
21.6%
2.01659
 
11.1%
1.0873
 
5.8%
(Missing)48
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
4.04846
32.4%
5.04336
29.0%
3.03238
21.7%
2.01659
 
11.1%
1.0873
 
5.8%

Most occurring characters

ValueCountFrequency (%)
.14952
33.3%
014952
33.3%
44846
 
10.8%
54336
 
9.7%
33238
 
7.2%
21659
 
3.7%
1873
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29904
66.7%
Other Punctuation14952
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014952
50.0%
44846
 
16.2%
54336
 
14.5%
33238
 
10.8%
21659
 
5.5%
1873
 
2.9%
Other Punctuation
ValueCountFrequency (%)
.14952
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44856
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14952
33.3%
014952
33.3%
44846
 
10.8%
54336
 
9.7%
33238
 
7.2%
21659
 
3.7%
1873
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII44856
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14952
33.3%
014952
33.3%
44846
 
10.8%
54336
 
9.7%
33238
 
7.2%
21659
 
3.7%
1873
 
1.9%

Q7
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing76
Missing (%)0.5%
Memory size117.3 KiB
5.0
7584 
4.0
4385 
3.0
1581 
2.0
854 
1.0
 
520

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44772
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row4.0
3rd row5.0
4th row3.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.07584
50.6%
4.04385
29.2%
3.01581
 
10.5%
2.0854
 
5.7%
1.0520
 
3.5%
(Missing)76
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.07584
50.8%
4.04385
29.4%
3.01581
 
10.6%
2.0854
 
5.7%
1.0520
 
3.5%

Most occurring characters

ValueCountFrequency (%)
.14924
33.3%
014924
33.3%
57584
16.9%
44385
 
9.8%
31581
 
3.5%
2854
 
1.9%
1520
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29848
66.7%
Other Punctuation14924
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014924
50.0%
57584
25.4%
44385
 
14.7%
31581
 
5.3%
2854
 
2.9%
1520
 
1.7%
Other Punctuation
ValueCountFrequency (%)
.14924
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44772
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14924
33.3%
014924
33.3%
57584
16.9%
44385
 
9.8%
31581
 
3.5%
2854
 
1.9%
1520
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII44772
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14924
33.3%
014924
33.3%
57584
16.9%
44385
 
9.8%
31581
 
3.5%
2854
 
1.9%
1520
 
1.2%

Q8
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing48
Missing (%)0.3%
Memory size117.3 KiB
5.0
6605 
4.0
3888 
3.0
1823 
1.0
1319 
2.0
1317 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44856
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row4.0
3rd row5.0
4th row5.0
5th row2.0

Common Values

ValueCountFrequency (%)
5.06605
44.0%
4.03888
25.9%
3.01823
 
12.2%
1.01319
 
8.8%
2.01317
 
8.8%
(Missing)48
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.06605
44.2%
4.03888
26.0%
3.01823
 
12.2%
1.01319
 
8.8%
2.01317
 
8.8%

Most occurring characters

ValueCountFrequency (%)
.14952
33.3%
014952
33.3%
56605
14.7%
43888
 
8.7%
31823
 
4.1%
11319
 
2.9%
21317
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29904
66.7%
Other Punctuation14952
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014952
50.0%
56605
22.1%
43888
 
13.0%
31823
 
6.1%
11319
 
4.4%
21317
 
4.4%
Other Punctuation
ValueCountFrequency (%)
.14952
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44856
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14952
33.3%
014952
33.3%
56605
14.7%
43888
 
8.7%
31823
 
4.1%
11319
 
2.9%
21317
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII44856
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14952
33.3%
014952
33.3%
56605
14.7%
43888
 
8.7%
31823
 
4.1%
11319
 
2.9%
21317
 
2.9%

Q9
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing56
Missing (%)0.4%
Memory size117.3 KiB
5.0
5627 
4.0
4892 
3.0
2620 
2.0
1144 
1.0
661 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44832
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row3.0
3rd row4.0
4th row3.0
5th row3.0

Common Values

ValueCountFrequency (%)
5.05627
37.5%
4.04892
32.6%
3.02620
17.5%
2.01144
 
7.6%
1.0661
 
4.4%
(Missing)56
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.05627
37.7%
4.04892
32.7%
3.02620
17.5%
2.01144
 
7.7%
1.0661
 
4.4%

Most occurring characters

ValueCountFrequency (%)
.14944
33.3%
014944
33.3%
55627
 
12.6%
44892
 
10.9%
32620
 
5.8%
21144
 
2.6%
1661
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29888
66.7%
Other Punctuation14944
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014944
50.0%
55627
 
18.8%
44892
 
16.4%
32620
 
8.8%
21144
 
3.8%
1661
 
2.2%
Other Punctuation
ValueCountFrequency (%)
.14944
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44832
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14944
33.3%
014944
33.3%
55627
 
12.6%
44892
 
10.9%
32620
 
5.8%
21144
 
2.6%
1661
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII44832
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14944
33.3%
014944
33.3%
55627
 
12.6%
44892
 
10.9%
32620
 
5.8%
21144
 
2.6%
1661
 
1.5%

Q10
Categorical

Distinct5
Distinct (%)< 0.1%
Missing72
Missing (%)0.5%
Memory size117.3 KiB
5.0
6983 
4.0
4265 
3.0
2501 
2.0
706 
1.0
 
473

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44784
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row3.0
3rd row4.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.06983
46.6%
4.04265
28.4%
3.02501
 
16.7%
2.0706
 
4.7%
1.0473
 
3.2%
(Missing)72
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.06983
46.8%
4.04265
28.6%
3.02501
 
16.8%
2.0706
 
4.7%
1.0473
 
3.2%

Most occurring characters

ValueCountFrequency (%)
.14928
33.3%
014928
33.3%
56983
15.6%
44265
 
9.5%
32501
 
5.6%
2706
 
1.6%
1473
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29856
66.7%
Other Punctuation14928
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014928
50.0%
56983
23.4%
44265
 
14.3%
32501
 
8.4%
2706
 
2.4%
1473
 
1.6%
Other Punctuation
ValueCountFrequency (%)
.14928
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44784
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14928
33.3%
014928
33.3%
56983
15.6%
44265
 
9.5%
32501
 
5.6%
2706
 
1.6%
1473
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII44784
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14928
33.3%
014928
33.3%
56983
15.6%
44265
 
9.5%
32501
 
5.6%
2706
 
1.6%
1473
 
1.1%

Q11
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing59
Missing (%)0.4%
Memory size117.3 KiB
5.0
4280 
1.0
3863 
4.0
2740 
3.0
2328 
2.0
1730 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44823
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row1.0
3rd row2.0
4th row5.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.04280
28.5%
1.03863
25.8%
4.02740
18.3%
3.02328
15.5%
2.01730
11.5%
(Missing)59
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.04280
28.6%
1.03863
25.9%
4.02740
18.3%
3.02328
15.6%
2.01730
11.6%

Most occurring characters

ValueCountFrequency (%)
.14941
33.3%
014941
33.3%
54280
 
9.5%
13863
 
8.6%
42740
 
6.1%
32328
 
5.2%
21730
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29882
66.7%
Other Punctuation14941
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014941
50.0%
54280
 
14.3%
13863
 
12.9%
42740
 
9.2%
32328
 
7.8%
21730
 
5.8%
Other Punctuation
ValueCountFrequency (%)
.14941
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44823
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14941
33.3%
014941
33.3%
54280
 
9.5%
13863
 
8.6%
42740
 
6.1%
32328
 
5.2%
21730
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII44823
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14941
33.3%
014941
33.3%
54280
 
9.5%
13863
 
8.6%
42740
 
6.1%
32328
 
5.2%
21730
 
3.9%

Q12
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing67
Missing (%)0.4%
Memory size117.3 KiB
5.0
5421 
4.0
4749 
3.0
2181 
2.0
1502 
1.0
1080 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44799
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row4.0
3rd row5.0
4th row2.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.05421
36.1%
4.04749
31.7%
3.02181
14.5%
2.01502
 
10.0%
1.01080
 
7.2%
(Missing)67
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.05421
36.3%
4.04749
31.8%
3.02181
14.6%
2.01502
 
10.1%
1.01080
 
7.2%

Most occurring characters

ValueCountFrequency (%)
.14933
33.3%
014933
33.3%
55421
 
12.1%
44749
 
10.6%
32181
 
4.9%
21502
 
3.4%
11080
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29866
66.7%
Other Punctuation14933
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014933
50.0%
55421
 
18.2%
44749
 
15.9%
32181
 
7.3%
21502
 
5.0%
11080
 
3.6%
Other Punctuation
ValueCountFrequency (%)
.14933
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44799
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14933
33.3%
014933
33.3%
55421
 
12.1%
44749
 
10.6%
32181
 
4.9%
21502
 
3.4%
11080
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII44799
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14933
33.3%
014933
33.3%
55421
 
12.1%
44749
 
10.6%
32181
 
4.9%
21502
 
3.4%
11080
 
2.4%

Q13
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing40
Missing (%)0.3%
Memory size117.3 KiB
5.0
5816 
4.0
3226 
3.0
2864 
2.0
1743 
1.0
1311 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44880
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row5.0
3rd row5.0
4th row2.0
5th row3.0

Common Values

ValueCountFrequency (%)
5.05816
38.8%
4.03226
21.5%
3.02864
19.1%
2.01743
 
11.6%
1.01311
 
8.7%
(Missing)40
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.05816
38.9%
4.03226
21.6%
3.02864
19.1%
2.01743
 
11.7%
1.01311
 
8.8%

Most occurring characters

ValueCountFrequency (%)
.14960
33.3%
014960
33.3%
55816
 
13.0%
43226
 
7.2%
32864
 
6.4%
21743
 
3.9%
11311
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29920
66.7%
Other Punctuation14960
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014960
50.0%
55816
 
19.4%
43226
 
10.8%
32864
 
9.6%
21743
 
5.8%
11311
 
4.4%
Other Punctuation
ValueCountFrequency (%)
.14960
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44880
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14960
33.3%
014960
33.3%
55816
 
13.0%
43226
 
7.2%
32864
 
6.4%
21743
 
3.9%
11311
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII44880
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14960
33.3%
014960
33.3%
55816
 
13.0%
43226
 
7.2%
32864
 
6.4%
21743
 
3.9%
11311
 
2.9%

Q14
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing36
Missing (%)0.2%
Memory size117.3 KiB
5.0
4796 
4.0
4332 
3.0
3423 
2.0
1730 
1.0
683 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44892
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row3.0
3rd row5.0
4th row4.0
5th row5.0

Common Values

ValueCountFrequency (%)
5.04796
32.0%
4.04332
28.9%
3.03423
22.8%
2.01730
 
11.5%
1.0683
 
4.6%
(Missing)36
 
0.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.04796
32.1%
4.04332
28.9%
3.03423
22.9%
2.01730
 
11.6%
1.0683
 
4.6%

Most occurring characters

ValueCountFrequency (%)
.14964
33.3%
014964
33.3%
54796
 
10.7%
44332
 
9.6%
33423
 
7.6%
21730
 
3.9%
1683
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29928
66.7%
Other Punctuation14964
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014964
50.0%
54796
 
16.0%
44332
 
14.5%
33423
 
11.4%
21730
 
5.8%
1683
 
2.3%
Other Punctuation
ValueCountFrequency (%)
.14964
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44892
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14964
33.3%
014964
33.3%
54796
 
10.7%
44332
 
9.6%
33423
 
7.6%
21730
 
3.9%
1683
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII44892
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14964
33.3%
014964
33.3%
54796
 
10.7%
44332
 
9.6%
33423
 
7.6%
21730
 
3.9%
1683
 
1.5%

Q15
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing45
Missing (%)0.3%
Memory size117.3 KiB
1.0
3775 
5.0
3476 
4.0
3115 
3.0
2388 
2.0
2201 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44865
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row1.0
3rd row1.0
4th row4.0
5th row5.0

Common Values

ValueCountFrequency (%)
1.03775
25.2%
5.03476
23.2%
4.03115
20.8%
3.02388
15.9%
2.02201
14.7%
(Missing)45
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.03775
25.2%
5.03476
23.2%
4.03115
20.8%
3.02388
16.0%
2.02201
14.7%

Most occurring characters

ValueCountFrequency (%)
.14955
33.3%
014955
33.3%
13775
 
8.4%
53476
 
7.7%
43115
 
6.9%
32388
 
5.3%
22201
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29910
66.7%
Other Punctuation14955
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014955
50.0%
13775
 
12.6%
53476
 
11.6%
43115
 
10.4%
32388
 
8.0%
22201
 
7.4%
Other Punctuation
ValueCountFrequency (%)
.14955
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44865
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14955
33.3%
014955
33.3%
13775
 
8.4%
53476
 
7.7%
43115
 
6.9%
32388
 
5.3%
22201
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII44865
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14955
33.3%
014955
33.3%
13775
 
8.4%
53476
 
7.7%
43115
 
6.9%
32388
 
5.3%
22201
 
4.9%

Q16
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing33
Missing (%)0.2%
Memory size117.3 KiB
5.0
4807 
3.0
3028 
4.0
2965 
2.0
2245 
1.0
1922 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44901
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row2.0
3rd row3.0
4th row2.0
5th row2.0

Common Values

ValueCountFrequency (%)
5.04807
32.0%
3.03028
20.2%
4.02965
19.8%
2.02245
15.0%
1.01922
 
12.8%
(Missing)33
 
0.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.04807
32.1%
3.03028
20.2%
4.02965
19.8%
2.02245
15.0%
1.01922
 
12.8%

Most occurring characters

ValueCountFrequency (%)
.14967
33.3%
014967
33.3%
54807
 
10.7%
33028
 
6.7%
42965
 
6.6%
22245
 
5.0%
11922
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29934
66.7%
Other Punctuation14967
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014967
50.0%
54807
 
16.1%
33028
 
10.1%
42965
 
9.9%
22245
 
7.5%
11922
 
6.4%
Other Punctuation
ValueCountFrequency (%)
.14967
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44901
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14967
33.3%
014967
33.3%
54807
 
10.7%
33028
 
6.7%
42965
 
6.6%
22245
 
5.0%
11922
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII44901
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14967
33.3%
014967
33.3%
54807
 
10.7%
33028
 
6.7%
42965
 
6.6%
22245
 
5.0%
11922
 
4.3%

Q17
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing37
Missing (%)0.2%
Memory size117.3 KiB
5.0
6138 
4.0
4346 
3.0
2509 
2.0
1400 
1.0
 
570

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44889
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row4.0
3rd row5.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.06138
40.9%
4.04346
29.0%
3.02509
16.7%
2.01400
 
9.3%
1.0570
 
3.8%
(Missing)37
 
0.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.06138
41.0%
4.04346
29.0%
3.02509
16.8%
2.01400
 
9.4%
1.0570
 
3.8%

Most occurring characters

ValueCountFrequency (%)
.14963
33.3%
014963
33.3%
56138
13.7%
44346
 
9.7%
32509
 
5.6%
21400
 
3.1%
1570
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29926
66.7%
Other Punctuation14963
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014963
50.0%
56138
20.5%
44346
 
14.5%
32509
 
8.4%
21400
 
4.7%
1570
 
1.9%
Other Punctuation
ValueCountFrequency (%)
.14963
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44889
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14963
33.3%
014963
33.3%
56138
13.7%
44346
 
9.7%
32509
 
5.6%
21400
 
3.1%
1570
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII44889
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14963
33.3%
014963
33.3%
56138
13.7%
44346
 
9.7%
32509
 
5.6%
21400
 
3.1%
1570
 
1.3%

Q18
Categorical

Distinct5
Distinct (%)< 0.1%
Missing63
Missing (%)0.4%
Memory size117.3 KiB
5.0
7010 
4.0
3437 
3.0
1969 
1.0
1441 
2.0
1080 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44811
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row5.0
3rd row3.0
4th row5.0
5th row1.0

Common Values

ValueCountFrequency (%)
5.07010
46.7%
4.03437
22.9%
3.01969
 
13.1%
1.01441
 
9.6%
2.01080
 
7.2%
(Missing)63
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.07010
46.9%
4.03437
23.0%
3.01969
 
13.2%
1.01441
 
9.6%
2.01080
 
7.2%

Most occurring characters

ValueCountFrequency (%)
.14937
33.3%
014937
33.3%
57010
15.6%
43437
 
7.7%
31969
 
4.4%
11441
 
3.2%
21080
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29874
66.7%
Other Punctuation14937
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014937
50.0%
57010
23.5%
43437
 
11.5%
31969
 
6.6%
11441
 
4.8%
21080
 
3.6%
Other Punctuation
ValueCountFrequency (%)
.14937
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44811
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14937
33.3%
014937
33.3%
57010
15.6%
43437
 
7.7%
31969
 
4.4%
11441
 
3.2%
21080
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII44811
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14937
33.3%
014937
33.3%
57010
15.6%
43437
 
7.7%
31969
 
4.4%
11441
 
3.2%
21080
 
2.4%

Q19
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing53
Missing (%)0.4%
Memory size117.3 KiB
5.0
5413 
1.0
2901 
4.0
2725 
2.0
1983 
3.0
1925 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44841
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row1.0
3rd row5.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.05413
36.1%
1.02901
19.3%
4.02725
18.2%
2.01983
 
13.2%
3.01925
 
12.8%
(Missing)53
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.05413
36.2%
1.02901
19.4%
4.02725
18.2%
2.01983
 
13.3%
3.01925
 
12.9%

Most occurring characters

ValueCountFrequency (%)
.14947
33.3%
014947
33.3%
55413
 
12.1%
12901
 
6.5%
42725
 
6.1%
21983
 
4.4%
31925
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29894
66.7%
Other Punctuation14947
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014947
50.0%
55413
 
18.1%
12901
 
9.7%
42725
 
9.1%
21983
 
6.6%
31925
 
6.4%
Other Punctuation
ValueCountFrequency (%)
.14947
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44841
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14947
33.3%
014947
33.3%
55413
 
12.1%
12901
 
6.5%
42725
 
6.1%
21983
 
4.4%
31925
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII44841
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14947
33.3%
014947
33.3%
55413
 
12.1%
12901
 
6.5%
42725
 
6.1%
21983
 
4.4%
31925
 
4.3%

Q20
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing45
Missing (%)0.3%
Memory size117.3 KiB
5.0
4698 
4.0
3891 
3.0
3877 
2.0
1771 
1.0
718 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44865
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row3.0
3rd row2.0
4th row3.0
5th row2.0

Common Values

ValueCountFrequency (%)
5.04698
31.3%
4.03891
25.9%
3.03877
25.8%
2.01771
 
11.8%
1.0718
 
4.8%
(Missing)45
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.04698
31.4%
4.03891
26.0%
3.03877
25.9%
2.01771
 
11.8%
1.0718
 
4.8%

Most occurring characters

ValueCountFrequency (%)
.14955
33.3%
014955
33.3%
54698
 
10.5%
43891
 
8.7%
33877
 
8.6%
21771
 
3.9%
1718
 
1.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29910
66.7%
Other Punctuation14955
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014955
50.0%
54698
 
15.7%
43891
 
13.0%
33877
 
13.0%
21771
 
5.9%
1718
 
2.4%
Other Punctuation
ValueCountFrequency (%)
.14955
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44865
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14955
33.3%
014955
33.3%
54698
 
10.5%
43891
 
8.7%
33877
 
8.6%
21771
 
3.9%
1718
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII44865
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14955
33.3%
014955
33.3%
54698
 
10.5%
43891
 
8.7%
33877
 
8.6%
21771
 
3.9%
1718
 
1.6%

Q21
Categorical

Distinct5
Distinct (%)< 0.1%
Missing39
Missing (%)0.3%
Memory size117.3 KiB
1.0
4591 
5.0
3893 
4.0
2882 
2.0
1959 
3.0
1636 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44883
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row1.0
3rd row2.0
4th row3.0
5th row4.0

Common Values

ValueCountFrequency (%)
1.04591
30.6%
5.03893
26.0%
4.02882
19.2%
2.01959
13.1%
3.01636
 
10.9%
(Missing)39
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.04591
30.7%
5.03893
26.0%
4.02882
19.3%
2.01959
13.1%
3.01636
 
10.9%

Most occurring characters

ValueCountFrequency (%)
.14961
33.3%
014961
33.3%
14591
 
10.2%
53893
 
8.7%
42882
 
6.4%
21959
 
4.4%
31636
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29922
66.7%
Other Punctuation14961
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014961
50.0%
14591
 
15.3%
53893
 
13.0%
42882
 
9.6%
21959
 
6.5%
31636
 
5.5%
Other Punctuation
ValueCountFrequency (%)
.14961
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44883
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14961
33.3%
014961
33.3%
14591
 
10.2%
53893
 
8.7%
42882
 
6.4%
21959
 
4.4%
31636
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII44883
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14961
33.3%
014961
33.3%
14591
 
10.2%
53893
 
8.7%
42882
 
6.4%
21959
 
4.4%
31636
 
3.6%

Q22
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing38
Missing (%)0.3%
Memory size117.3 KiB
1.0
5272 
2.0
3560 
3.0
2560 
4.0
1821 
5.0
1749 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44886
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row1.0
3rd row1.0
4th row4.0
5th row2.0

Common Values

ValueCountFrequency (%)
1.05272
35.1%
2.03560
23.7%
3.02560
17.1%
4.01821
 
12.1%
5.01749
 
11.7%
(Missing)38
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.05272
35.2%
2.03560
23.8%
3.02560
17.1%
4.01821
 
12.2%
5.01749
 
11.7%

Most occurring characters

ValueCountFrequency (%)
.14962
33.3%
014962
33.3%
15272
 
11.7%
23560
 
7.9%
32560
 
5.7%
41821
 
4.1%
51749
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29924
66.7%
Other Punctuation14962
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014962
50.0%
15272
 
17.6%
23560
 
11.9%
32560
 
8.6%
41821
 
6.1%
51749
 
5.8%
Other Punctuation
ValueCountFrequency (%)
.14962
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44886
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14962
33.3%
014962
33.3%
15272
 
11.7%
23560
 
7.9%
32560
 
5.7%
41821
 
4.1%
51749
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII44886
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14962
33.3%
014962
33.3%
15272
 
11.7%
23560
 
7.9%
32560
 
5.7%
41821
 
4.1%
51749
 
3.9%

Q23
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing50
Missing (%)0.3%
Memory size117.3 KiB
5.0
6756 
4.0
3498 
3.0
1708 
1.0
1532 
2.0
1456 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44850
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row5.0
3rd row2.0
4th row3.0
5th row3.0

Common Values

ValueCountFrequency (%)
5.06756
45.0%
4.03498
23.3%
3.01708
 
11.4%
1.01532
 
10.2%
2.01456
 
9.7%
(Missing)50
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.06756
45.2%
4.03498
23.4%
3.01708
 
11.4%
1.01532
 
10.2%
2.01456
 
9.7%

Most occurring characters

ValueCountFrequency (%)
.14950
33.3%
014950
33.3%
56756
15.1%
43498
 
7.8%
31708
 
3.8%
11532
 
3.4%
21456
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29900
66.7%
Other Punctuation14950
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014950
50.0%
56756
22.6%
43498
 
11.7%
31708
 
5.7%
11532
 
5.1%
21456
 
4.9%
Other Punctuation
ValueCountFrequency (%)
.14950
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44850
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14950
33.3%
014950
33.3%
56756
15.1%
43498
 
7.8%
31708
 
3.8%
11532
 
3.4%
21456
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII44850
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14950
33.3%
014950
33.3%
56756
15.1%
43498
 
7.8%
31708
 
3.8%
11532
 
3.4%
21456
 
3.2%

Q24
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing61
Missing (%)0.4%
Memory size117.3 KiB
5.0
7792 
4.0
4896 
3.0
1433 
2.0
 
528
1.0
 
290

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44817
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row3.0
3rd row4.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.07792
51.9%
4.04896
32.6%
3.01433
 
9.6%
2.0528
 
3.5%
1.0290
 
1.9%
(Missing)61
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.07792
52.2%
4.04896
32.8%
3.01433
 
9.6%
2.0528
 
3.5%
1.0290
 
1.9%

Most occurring characters

ValueCountFrequency (%)
.14939
33.3%
014939
33.3%
57792
17.4%
44896
 
10.9%
31433
 
3.2%
2528
 
1.2%
1290
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29878
66.7%
Other Punctuation14939
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014939
50.0%
57792
26.1%
44896
 
16.4%
31433
 
4.8%
2528
 
1.8%
1290
 
1.0%
Other Punctuation
ValueCountFrequency (%)
.14939
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44817
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14939
33.3%
014939
33.3%
57792
17.4%
44896
 
10.9%
31433
 
3.2%
2528
 
1.2%
1290
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII44817
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14939
33.3%
014939
33.3%
57792
17.4%
44896
 
10.9%
31433
 
3.2%
2528
 
1.2%
1290
 
0.6%

Q25
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing44
Missing (%)0.3%
Memory size117.3 KiB
4.0
3782 
5.0
3080 
3.0
2875 
2.0
2761 
1.0
2458 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44868
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row2.0
3rd row2.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
4.03782
25.2%
5.03080
20.5%
3.02875
19.2%
2.02761
18.4%
1.02458
16.4%
(Missing)44
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
4.03782
25.3%
5.03080
20.6%
3.02875
19.2%
2.02761
18.5%
1.02458
16.4%

Most occurring characters

ValueCountFrequency (%)
.14956
33.3%
014956
33.3%
43782
 
8.4%
53080
 
6.9%
32875
 
6.4%
22761
 
6.2%
12458
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29912
66.7%
Other Punctuation14956
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014956
50.0%
43782
 
12.6%
53080
 
10.3%
32875
 
9.6%
22761
 
9.2%
12458
 
8.2%
Other Punctuation
ValueCountFrequency (%)
.14956
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44868
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14956
33.3%
014956
33.3%
43782
 
8.4%
53080
 
6.9%
32875
 
6.4%
22761
 
6.2%
12458
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII44868
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14956
33.3%
014956
33.3%
43782
 
8.4%
53080
 
6.9%
32875
 
6.4%
22761
 
6.2%
12458
 
5.5%

Q26
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing68
Missing (%)0.5%
Memory size117.3 KiB
5.0
7271 
4.0
4465 
3.0
1918 
2.0
892 
1.0
 
386

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44796
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row5.0
3rd row5.0
4th row2.0
5th row4.0

Common Values

ValueCountFrequency (%)
5.07271
48.5%
4.04465
29.8%
3.01918
 
12.8%
2.0892
 
5.9%
1.0386
 
2.6%
(Missing)68
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
5.07271
48.7%
4.04465
29.9%
3.01918
 
12.8%
2.0892
 
6.0%
1.0386
 
2.6%

Most occurring characters

ValueCountFrequency (%)
.14932
33.3%
014932
33.3%
57271
16.2%
44465
 
10.0%
31918
 
4.3%
2892
 
2.0%
1386
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29864
66.7%
Other Punctuation14932
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014932
50.0%
57271
24.3%
44465
 
15.0%
31918
 
6.4%
2892
 
3.0%
1386
 
1.3%
Other Punctuation
ValueCountFrequency (%)
.14932
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44796
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14932
33.3%
014932
33.3%
57271
16.2%
44465
 
10.0%
31918
 
4.3%
2892
 
2.0%
1386
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII44796
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14932
33.3%
014932
33.3%
57271
16.2%
44465
 
10.0%
31918
 
4.3%
2892
 
2.0%
1386
 
0.9%

country
Categorical

HIGH CARDINALITY
MISSING

Distinct137
Distinct (%)0.9%
Missing190
Missing (%)1.3%
Memory size117.3 KiB
USA
7419 
GBR
1109 
CAN
915 
AUS
 
525
DEU
 
473
Other values (132)
4369 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44430
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)0.2%

Sample

1st rowUSA
2nd rowUSA
3rd rowNLD
4th rowUSA
5th rowITA

Common Values

ValueCountFrequency (%)
USA7419
49.5%
GBR1109
 
7.4%
CAN915
 
6.1%
AUS525
 
3.5%
DEU473
 
3.2%
PHL261
 
1.7%
BRA240
 
1.6%
IND233
 
1.6%
POL210
 
1.4%
FRA208
 
1.4%
Other values (127)3217
21.4%
(Missing)190
 
1.3%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
usa7419
50.1%
gbr1109
 
7.5%
can915
 
6.2%
aus525
 
3.5%
deu473
 
3.2%
phl261
 
1.8%
bra240
 
1.6%
ind233
 
1.6%
pol210
 
1.4%
fra208
 
1.4%
Other values (127)3217
21.7%

Most occurring characters

ValueCountFrequency (%)
A9845
22.2%
U8829
19.9%
S8648
19.5%
R2484
 
5.6%
N2112
 
4.8%
B1519
 
3.4%
G1446
 
3.3%
C1176
 
2.6%
E1128
 
2.5%
D1117
 
2.5%
Other values (16)6126
13.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter44430
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A9845
22.2%
U8829
19.9%
S8648
19.5%
R2484
 
5.6%
N2112
 
4.8%
B1519
 
3.4%
G1446
 
3.3%
C1176
 
2.6%
E1128
 
2.5%
D1117
 
2.5%
Other values (16)6126
13.8%

Most occurring scripts

ValueCountFrequency (%)
Latin44430
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A9845
22.2%
U8829
19.9%
S8648
19.5%
R2484
 
5.6%
N2112
 
4.8%
B1519
 
3.4%
G1446
 
3.3%
C1176
 
2.6%
E1128
 
2.5%
D1117
 
2.5%
Other values (16)6126
13.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII44430
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A9845
22.2%
U8829
19.9%
S8648
19.5%
R2484
 
5.6%
N2112
 
4.8%
B1519
 
3.4%
G1446
 
3.3%
C1176
 
2.6%
E1128
 
2.5%
D1117
 
2.5%
Other values (16)6126
13.8%

introelapse
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct1315
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean767.1377333
Minimum1
Maximum855030
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size117.3 KiB

Quantile statistics

Minimum1
5-th percentile2
Q14
median10
Q336
95-th percentile976
Maximum855030
Range855029
Interquartile range (IQR)32

Descriptive statistics

Standard deviation13835.94804
Coefficient of variation (CV)18.03580692
Kurtosis2797.753594
Mean767.1377333
Median Absolute Deviation (MAD)8
Skewness48.40398766
Sum11507066
Variance191433458.1
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
21656
 
11.0%
31458
 
9.7%
41030
 
6.9%
5691
 
4.6%
6572
 
3.8%
7485
 
3.2%
8463
 
3.1%
1429
 
2.9%
9429
 
2.9%
10376
 
2.5%
Other values (1305)7411
49.4%
ValueCountFrequency (%)
1429
 
2.9%
21656
11.0%
31458
9.7%
41030
6.9%
5691
4.6%
6572
 
3.8%
7485
 
3.2%
8463
 
3.1%
9429
 
2.9%
10376
 
2.5%
ValueCountFrequency (%)
8550302
< 0.1%
8171471
< 0.1%
2854182
< 0.1%
2584172
< 0.1%
2330951
< 0.1%
1909361
< 0.1%
1484572
< 0.1%
1350061
< 0.1%
1336901
< 0.1%
1232102
< 0.1%

testelapse
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct684
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean387.9656667
Minimum1
Maximum474572
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size117.3 KiB

Quantile statistics

Minimum1
5-th percentile60
Q182
median106
Q3140
95-th percentile290
Maximum474572
Range474571
Interquartile range (IQR)58

Descriptive statistics

Standard deviation8513.03161
Coefficient of variation (CV)21.94274479
Kurtosis2135.513955
Mean387.9656667
Median Absolute Deviation (MAD)27
Skewness44.46473054
Sum5819485
Variance72471707.2
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
82187
 
1.2%
80179
 
1.2%
104177
 
1.2%
87171
 
1.1%
106170
 
1.1%
94169
 
1.1%
98169
 
1.1%
95167
 
1.1%
74166
 
1.1%
89166
 
1.1%
Other values (674)13279
88.5%
ValueCountFrequency (%)
11
 
< 0.1%
42
< 0.1%
251
 
< 0.1%
304
< 0.1%
323
< 0.1%
331
 
< 0.1%
341
 
< 0.1%
352
< 0.1%
361
 
< 0.1%
373
< 0.1%
ValueCountFrequency (%)
4745722
< 0.1%
4072081
< 0.1%
3747441
< 0.1%
2853992
< 0.1%
2400222
< 0.1%
1279621
< 0.1%
925941
< 0.1%
924621
< 0.1%
716441
< 0.1%
615441
< 0.1%

surveyelapse
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct712
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2787.958533
Minimum3
Maximum15166994
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size117.3 KiB

Quantile statistics

Minimum3
5-th percentile89
Q1126
median164
Q3217
95-th percentile381
Maximum15166994
Range15166991
Interquartile range (IQR)91

Descriptive statistics

Standard deviation178595.4557
Coefficient of variation (CV)64.05958109
Kurtosis6941.824191
Mean2787.958533
Median Absolute Deviation (MAD)43
Skewness82.28836108
Sum41819378
Variance3.189633681 × 1010
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
161130
 
0.9%
142128
 
0.9%
115118
 
0.8%
143116
 
0.8%
154116
 
0.8%
146115
 
0.8%
151114
 
0.8%
118112
 
0.7%
139111
 
0.7%
133110
 
0.7%
Other values (702)13830
92.2%
ValueCountFrequency (%)
31
 
< 0.1%
44
< 0.1%
55
< 0.1%
65
< 0.1%
83
< 0.1%
92
 
< 0.1%
101
 
< 0.1%
112
 
< 0.1%
131
 
< 0.1%
231
 
< 0.1%
ValueCountFrequency (%)
151669942
< 0.1%
34200081
< 0.1%
25135421
< 0.1%
4749401
< 0.1%
1959321
< 0.1%
1758362
< 0.1%
1734101
< 0.1%
1343931
< 0.1%
898241
< 0.1%
808101
< 0.1%

TIPI1
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing53
Missing (%)0.4%
Memory size117.3 KiB
3.0
5414 
2.0
3451 
1.0
3098 
4.0
1954 
5.0
1030 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44841
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4.0
2nd row4.0
3rd row1.0
4th row3.0
5th row3.0

Common Values

ValueCountFrequency (%)
3.05414
36.1%
2.03451
23.0%
1.03098
20.7%
4.01954
 
13.0%
5.01030
 
6.9%
(Missing)53
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.05414
36.2%
2.03451
23.1%
1.03098
20.7%
4.01954
 
13.1%
5.01030
 
6.9%

Most occurring characters

ValueCountFrequency (%)
.14947
33.3%
014947
33.3%
35414
 
12.1%
23451
 
7.7%
13098
 
6.9%
41954
 
4.4%
51030
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29894
66.7%
Other Punctuation14947
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014947
50.0%
35414
 
18.1%
23451
 
11.5%
13098
 
10.4%
41954
 
6.5%
51030
 
3.4%
Other Punctuation
ValueCountFrequency (%)
.14947
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44841
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14947
33.3%
014947
33.3%
35414
 
12.1%
23451
 
7.7%
13098
 
6.9%
41954
 
4.4%
51030
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII44841
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14947
33.3%
014947
33.3%
35414
 
12.1%
23451
 
7.7%
13098
 
6.9%
41954
 
4.4%
51030
 
2.3%

TIPI2
Categorical

Distinct5
Distinct (%)< 0.1%
Missing66
Missing (%)0.4%
Memory size117.3 KiB
3.0
7062 
4.0
2932 
2.0
1969 
1.0
1586 
5.0
1385 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44802
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row2.0
3rd row2.0
4th row3.0
5th row3.0

Common Values

ValueCountFrequency (%)
3.07062
47.1%
4.02932
19.5%
2.01969
 
13.1%
1.01586
 
10.6%
5.01385
 
9.2%
(Missing)66
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.07062
47.3%
4.02932
19.6%
2.01969
 
13.2%
1.01586
 
10.6%
5.01385
 
9.3%

Most occurring characters

ValueCountFrequency (%)
.14934
33.3%
014934
33.3%
37062
15.8%
42932
 
6.5%
21969
 
4.4%
11586
 
3.5%
51385
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29868
66.7%
Other Punctuation14934
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014934
50.0%
37062
23.6%
42932
 
9.8%
21969
 
6.6%
11586
 
5.3%
51385
 
4.6%
Other Punctuation
ValueCountFrequency (%)
.14934
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44802
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14934
33.3%
014934
33.3%
37062
15.8%
42932
 
6.5%
21969
 
4.4%
11586
 
3.5%
51385
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII44802
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14934
33.3%
014934
33.3%
37062
15.8%
42932
 
6.5%
21969
 
4.4%
11586
 
3.5%
51385
 
3.1%

TIPI3
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing79
Missing (%)0.5%
Memory size117.3 KiB
3.0
6500 
4.0
4020 
5.0
2644 
2.0
1161 
1.0
 
596

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44763
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row3.0
3rd row3.0
4th row3.0
5th row4.0

Common Values

ValueCountFrequency (%)
3.06500
43.3%
4.04020
26.8%
5.02644
17.6%
2.01161
 
7.7%
1.0596
 
4.0%
(Missing)79
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.06500
43.6%
4.04020
26.9%
5.02644
17.7%
2.01161
 
7.8%
1.0596
 
4.0%

Most occurring characters

ValueCountFrequency (%)
.14921
33.3%
014921
33.3%
36500
14.5%
44020
 
9.0%
52644
 
5.9%
21161
 
2.6%
1596
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29842
66.7%
Other Punctuation14921
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014921
50.0%
36500
21.8%
44020
 
13.5%
52644
 
8.9%
21161
 
3.9%
1596
 
2.0%
Other Punctuation
ValueCountFrequency (%)
.14921
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44763
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14921
33.3%
014921
33.3%
36500
14.5%
44020
 
9.0%
52644
 
5.9%
21161
 
2.6%
1596
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII44763
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14921
33.3%
014921
33.3%
36500
14.5%
44020
 
9.0%
52644
 
5.9%
21161
 
2.6%
1596
 
1.3%

TIPI4
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing64
Missing (%)0.4%
Memory size117.3 KiB
3.0
5923 
4.0
2993 
5.0
2849 
2.0
1763 
1.0
1408 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44808
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row5.0
3rd row1.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
3.05923
39.5%
4.02993
20.0%
5.02849
19.0%
2.01763
 
11.8%
1.01408
 
9.4%
(Missing)64
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.05923
39.7%
4.02993
20.0%
5.02849
19.1%
2.01763
 
11.8%
1.01408
 
9.4%

Most occurring characters

ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
35923
 
13.2%
42993
 
6.7%
52849
 
6.4%
21763
 
3.9%
11408
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29872
66.7%
Other Punctuation14936
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014936
50.0%
35923
 
19.8%
42993
 
10.0%
52849
 
9.5%
21763
 
5.9%
11408
 
4.7%
Other Punctuation
ValueCountFrequency (%)
.14936
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44808
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
35923
 
13.2%
42993
 
6.7%
52849
 
6.4%
21763
 
3.9%
11408
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII44808
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
35923
 
13.2%
42993
 
6.7%
52849
 
6.4%
21763
 
3.9%
11408
 
3.1%

TIPI5
Categorical

Distinct5
Distinct (%)< 0.1%
Missing70
Missing (%)0.5%
Memory size117.3 KiB
3.0
5137 
4.0
4653 
5.0
4565 
2.0
 
447
1.0
 
128

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44790
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row3.0
3rd row5.0
4th row5.0
5th row4.0

Common Values

ValueCountFrequency (%)
3.05137
34.2%
4.04653
31.0%
5.04565
30.4%
2.0447
 
3.0%
1.0128
 
0.9%
(Missing)70
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.05137
34.4%
4.04653
31.2%
5.04565
30.6%
2.0447
 
3.0%
1.0128
 
0.9%

Most occurring characters

ValueCountFrequency (%)
.14930
33.3%
014930
33.3%
35137
 
11.5%
44653
 
10.4%
54565
 
10.2%
2447
 
1.0%
1128
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29860
66.7%
Other Punctuation14930
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014930
50.0%
35137
 
17.2%
44653
 
15.6%
54565
 
15.3%
2447
 
1.5%
1128
 
0.4%
Other Punctuation
ValueCountFrequency (%)
.14930
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44790
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14930
33.3%
014930
33.3%
35137
 
11.5%
44653
 
10.4%
54565
 
10.2%
2447
 
1.0%
1128
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII44790
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14930
33.3%
014930
33.3%
35137
 
11.5%
44653
 
10.4%
54565
 
10.2%
2447
 
1.0%
1128
 
0.3%

TIPI6
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing62
Missing (%)0.4%
Memory size117.3 KiB
3.0
5264 
5.0
4483 
4.0
3571 
2.0
895 
1.0
725 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44814
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row2.0
3rd row5.0
4th row3.0
5th row4.0

Common Values

ValueCountFrequency (%)
3.05264
35.1%
5.04483
29.9%
4.03571
23.8%
2.0895
 
6.0%
1.0725
 
4.8%
(Missing)62
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.05264
35.2%
5.04483
30.0%
4.03571
23.9%
2.0895
 
6.0%
1.0725
 
4.9%

Most occurring characters

ValueCountFrequency (%)
.14938
33.3%
014938
33.3%
35264
 
11.7%
54483
 
10.0%
43571
 
8.0%
2895
 
2.0%
1725
 
1.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29876
66.7%
Other Punctuation14938
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014938
50.0%
35264
 
17.6%
54483
 
15.0%
43571
 
12.0%
2895
 
3.0%
1725
 
2.4%
Other Punctuation
ValueCountFrequency (%)
.14938
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44814
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14938
33.3%
014938
33.3%
35264
 
11.7%
54483
 
10.0%
43571
 
8.0%
2895
 
2.0%
1725
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII44814
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14938
33.3%
014938
33.3%
35264
 
11.7%
54483
 
10.0%
43571
 
8.0%
2895
 
2.0%
1725
 
1.6%

TIPI7
Categorical

Distinct5
Distinct (%)< 0.1%
Missing64
Missing (%)0.4%
Memory size117.3 KiB
3.0
6403 
4.0
3793 
5.0
3134 
2.0
1055 
1.0
 
551

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44808
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row5.0
3rd row3.0
4th row4.0
5th row3.0

Common Values

ValueCountFrequency (%)
3.06403
42.7%
4.03793
25.3%
5.03134
20.9%
2.01055
 
7.0%
1.0551
 
3.7%
(Missing)64
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.06403
42.9%
4.03793
25.4%
5.03134
21.0%
2.01055
 
7.1%
1.0551
 
3.7%

Most occurring characters

ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
36403
14.3%
43793
 
8.5%
53134
 
7.0%
21055
 
2.4%
1551
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29872
66.7%
Other Punctuation14936
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014936
50.0%
36403
21.4%
43793
 
12.7%
53134
 
10.5%
21055
 
3.5%
1551
 
1.8%
Other Punctuation
ValueCountFrequency (%)
.14936
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44808
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
36403
14.3%
43793
 
8.5%
53134
 
7.0%
21055
 
2.4%
1551
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII44808
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
36403
14.3%
43793
 
8.5%
53134
 
7.0%
21055
 
2.4%
1551
 
1.2%

TIPI8
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing65
Missing (%)0.4%
Memory size117.3 KiB
3.0
6664 
4.0
2476 
2.0
2066 
5.0
1908 
1.0
1821 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44805
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row1.0
3rd row4.0
4th row4.0
5th row4.0

Common Values

ValueCountFrequency (%)
3.06664
44.4%
4.02476
 
16.5%
2.02066
 
13.8%
5.01908
 
12.7%
1.01821
 
12.1%
(Missing)65
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.06664
44.6%
4.02476
 
16.6%
2.02066
 
13.8%
5.01908
 
12.8%
1.01821
 
12.2%

Most occurring characters

ValueCountFrequency (%)
.14935
33.3%
014935
33.3%
36664
14.9%
42476
 
5.5%
22066
 
4.6%
51908
 
4.3%
11821
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29870
66.7%
Other Punctuation14935
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014935
50.0%
36664
22.3%
42476
 
8.3%
22066
 
6.9%
51908
 
6.4%
11821
 
6.1%
Other Punctuation
ValueCountFrequency (%)
.14935
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44805
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14935
33.3%
014935
33.3%
36664
14.9%
42476
 
5.5%
22066
 
4.6%
51908
 
4.3%
11821
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII44805
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14935
33.3%
014935
33.3%
36664
14.9%
42476
 
5.5%
22066
 
4.6%
51908
 
4.3%
11821
 
4.1%

TIPI9
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing64
Missing (%)0.4%
Memory size117.3 KiB
3.0
6927 
4.0
2927 
2.0
1952 
5.0
1924 
1.0
1206 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44808
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.0
2nd row2.0
3rd row5.0
4th row3.0
5th row3.0

Common Values

ValueCountFrequency (%)
3.06927
46.2%
4.02927
19.5%
2.01952
 
13.0%
5.01924
 
12.8%
1.01206
 
8.0%
(Missing)64
 
0.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.06927
46.4%
4.02927
19.6%
2.01952
 
13.1%
5.01924
 
12.9%
1.01206
 
8.1%

Most occurring characters

ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
36927
15.5%
42927
 
6.5%
21952
 
4.4%
51924
 
4.3%
11206
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29872
66.7%
Other Punctuation14936
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014936
50.0%
36927
23.2%
42927
 
9.8%
21952
 
6.5%
51924
 
6.4%
11206
 
4.0%
Other Punctuation
ValueCountFrequency (%)
.14936
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44808
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
36927
15.5%
42927
 
6.5%
21952
 
4.4%
51924
 
4.3%
11206
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII44808
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14936
33.3%
014936
33.3%
36927
15.5%
42927
 
6.5%
21952
 
4.4%
51924
 
4.3%
11206
 
2.7%

TIPI10
Categorical

Distinct5
Distinct (%)< 0.1%
Missing80
Missing (%)0.5%
Memory size117.3 KiB
3.0
5077 
1.0
4993 
2.0
3774 
4.0
665 
5.0
 
411

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44760
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row2.0
3rd row2.0
4th row3.0
5th row2.0

Common Values

ValueCountFrequency (%)
3.05077
33.8%
1.04993
33.3%
2.03774
25.2%
4.0665
 
4.4%
5.0411
 
2.7%
(Missing)80
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
3.05077
34.0%
1.04993
33.5%
2.03774
25.3%
4.0665
 
4.5%
5.0411
 
2.8%

Most occurring characters

ValueCountFrequency (%)
.14920
33.3%
014920
33.3%
35077
 
11.3%
14993
 
11.2%
23774
 
8.4%
4665
 
1.5%
5411
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29840
66.7%
Other Punctuation14920
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014920
50.0%
35077
 
17.0%
14993
 
16.7%
23774
 
12.6%
4665
 
2.2%
5411
 
1.4%
Other Punctuation
ValueCountFrequency (%)
.14920
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44760
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14920
33.3%
014920
33.3%
35077
 
11.3%
14993
 
11.2%
23774
 
8.4%
4665
 
1.5%
5411
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII44760
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14920
33.3%
014920
33.3%
35077
 
11.3%
14993
 
11.2%
23774
 
8.4%
4665
 
1.5%
5411
 
0.9%

VCL1
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
14380 
0
 
620

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
114380
95.9%
0620
 
4.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
114380
95.9%
0620
 
4.1%

Most occurring characters

ValueCountFrequency (%)
114380
95.9%
0620
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
114380
95.9%
0620
 
4.1%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
114380
95.9%
0620
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114380
95.9%
0620
 
4.1%

VCL2
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
13825 
0
 
1175

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
113825
92.2%
01175
 
7.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
113825
92.2%
01175
 
7.8%

Most occurring characters

ValueCountFrequency (%)
113825
92.2%
01175
 
7.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
113825
92.2%
01175
 
7.8%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
113825
92.2%
01175
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
113825
92.2%
01175
 
7.8%

VCL3
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
7937 
0
7063 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
17937
52.9%
07063
47.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
17937
52.9%
07063
47.1%

Most occurring characters

ValueCountFrequency (%)
17937
52.9%
07063
47.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
17937
52.9%
07063
47.1%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
17937
52.9%
07063
47.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17937
52.9%
07063
47.1%

VCL4
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
14526 
0
 
474

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
114526
96.8%
0474
 
3.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
114526
96.8%
0474
 
3.2%

Most occurring characters

ValueCountFrequency (%)
114526
96.8%
0474
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
114526
96.8%
0474
 
3.2%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
114526
96.8%
0474
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114526
96.8%
0474
 
3.2%

VCL5
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
14210 
0
 
790

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
114210
94.7%
0790
 
5.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
114210
94.7%
0790
 
5.3%

Most occurring characters

ValueCountFrequency (%)
114210
94.7%
0790
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
114210
94.7%
0790
 
5.3%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
114210
94.7%
0790
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114210
94.7%
0790
 
5.3%

VCL6
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
0
13477 
1
1523 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013477
89.8%
11523
 
10.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013477
89.8%
11523
 
10.2%

Most occurring characters

ValueCountFrequency (%)
013477
89.8%
11523
 
10.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013477
89.8%
11523
 
10.2%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013477
89.8%
11523
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013477
89.8%
11523
 
10.2%

VCL7
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
0
11460 
1
3540 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
011460
76.4%
13540
 
23.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
011460
76.4%
13540
 
23.6%

Most occurring characters

ValueCountFrequency (%)
011460
76.4%
13540
 
23.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
011460
76.4%
13540
 
23.6%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
011460
76.4%
13540
 
23.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
011460
76.4%
13540
 
23.6%

VCL8
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
0
8451 
1
6549 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
08451
56.3%
16549
43.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
08451
56.3%
16549
43.7%

Most occurring characters

ValueCountFrequency (%)
08451
56.3%
16549
43.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
08451
56.3%
16549
43.7%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
08451
56.3%
16549
43.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
08451
56.3%
16549
43.7%

VCL9
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
0
13992 
1
 
1008

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013992
93.3%
11008
 
6.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013992
93.3%
11008
 
6.7%

Most occurring characters

ValueCountFrequency (%)
013992
93.3%
11008
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013992
93.3%
11008
 
6.7%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013992
93.3%
11008
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013992
93.3%
11008
 
6.7%

VCL10
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
14341 
0
 
659

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
114341
95.6%
0659
 
4.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
114341
95.6%
0659
 
4.4%

Most occurring characters

ValueCountFrequency (%)
114341
95.6%
0659
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
114341
95.6%
0659
 
4.4%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
114341
95.6%
0659
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114341
95.6%
0659
 
4.4%

VCL11
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
0
10799 
1
4201 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
010799
72.0%
14201
 
28.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
010799
72.0%
14201
 
28.0%

Most occurring characters

ValueCountFrequency (%)
010799
72.0%
14201
 
28.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
010799
72.0%
14201
 
28.0%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
010799
72.0%
14201
 
28.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
010799
72.0%
14201
 
28.0%

VCL12
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
0
11846 
1
3154 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
011846
79.0%
13154
 
21.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
011846
79.0%
13154
 
21.0%

Most occurring characters

ValueCountFrequency (%)
011846
79.0%
13154
 
21.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
011846
79.0%
13154
 
21.0%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
011846
79.0%
13154
 
21.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
011846
79.0%
13154
 
21.0%

VCL13
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
11417 
0
3583 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row1
5th row0

Common Values

ValueCountFrequency (%)
111417
76.1%
03583
 
23.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
111417
76.1%
03583
 
23.9%

Most occurring characters

ValueCountFrequency (%)
111417
76.1%
03583
 
23.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
111417
76.1%
03583
 
23.9%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
111417
76.1%
03583
 
23.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
111417
76.1%
03583
 
23.9%

VCL14
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
13593 
0
1407 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
113593
90.6%
01407
 
9.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
113593
90.6%
01407
 
9.4%

Most occurring characters

ValueCountFrequency (%)
113593
90.6%
01407
 
9.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
113593
90.6%
01407
 
9.4%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
113593
90.6%
01407
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
113593
90.6%
01407
 
9.4%

VCL15
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
14398 
0
 
602

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row0

Common Values

ValueCountFrequency (%)
114398
96.0%
0602
 
4.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
114398
96.0%
0602
 
4.0%

Most occurring characters

ValueCountFrequency (%)
114398
96.0%
0602
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
114398
96.0%
0602
 
4.0%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
114398
96.0%
0602
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114398
96.0%
0602
 
4.0%

VCL16
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
14709 
0
 
291

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
114709
98.1%
0291
 
1.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
114709
98.1%
0291
 
1.9%

Most occurring characters

ValueCountFrequency (%)
114709
98.1%
0291
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
114709
98.1%
0291
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
114709
98.1%
0291
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114709
98.1%
0291
 
1.9%

education
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct4
Distinct (%)< 0.1%
Missing167
Missing (%)1.1%
Memory size117.3 KiB
2.0
6251 
3.0
3887 
1.0
2872 
4.0
1823 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44499
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row4.0
3rd row2.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
2.06251
41.7%
3.03887
25.9%
1.02872
19.1%
4.01823
 
12.2%
(Missing)167
 
1.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
2.06251
42.1%
3.03887
26.2%
1.02872
19.4%
4.01823
 
12.3%

Most occurring characters

ValueCountFrequency (%)
.14833
33.3%
014833
33.3%
26251
14.0%
33887
 
8.7%
12872
 
6.5%
41823
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29666
66.7%
Other Punctuation14833
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014833
50.0%
26251
21.1%
33887
 
13.1%
12872
 
9.7%
41823
 
6.1%
Other Punctuation
ValueCountFrequency (%)
.14833
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44499
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14833
33.3%
014833
33.3%
26251
14.0%
33887
 
8.7%
12872
 
6.5%
41823
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII44499
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14833
33.3%
014833
33.3%
26251
14.0%
33887
 
8.7%
12872
 
6.5%
41823
 
4.1%

urban
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
2
6888 
3
5326 
1
2704 
0
 
82

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row1
4th row3
5th row2

Common Values

ValueCountFrequency (%)
26888
45.9%
35326
35.5%
12704
 
18.0%
082
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
26888
45.9%
35326
35.5%
12704
 
18.0%
082
 
0.5%

Most occurring characters

ValueCountFrequency (%)
26888
45.9%
35326
35.5%
12704
 
18.0%
082
 
0.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
26888
45.9%
35326
35.5%
12704
 
18.0%
082
 
0.5%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
26888
45.9%
35326
35.5%
12704
 
18.0%
082
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
26888
45.9%
35326
35.5%
12704
 
18.0%
082
 
0.5%

gender
Categorical

Distinct3
Distinct (%)< 0.1%
Missing19
Missing (%)0.1%
Memory size117.3 KiB
2.0
9074 
1.0
5178 
3.0
 
729

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44943
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row2.0
3rd row1.0
4th row1.0
5th row2.0

Common Values

ValueCountFrequency (%)
2.09074
60.5%
1.05178
34.5%
3.0729
 
4.9%
(Missing)19
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
2.09074
60.6%
1.05178
34.6%
3.0729
 
4.9%

Most occurring characters

ValueCountFrequency (%)
.14981
33.3%
014981
33.3%
29074
20.2%
15178
 
11.5%
3729
 
1.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29962
66.7%
Other Punctuation14981
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014981
50.0%
29074
30.3%
15178
 
17.3%
3729
 
2.4%
Other Punctuation
ValueCountFrequency (%)
.14981
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44943
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14981
33.3%
014981
33.3%
29074
20.2%
15178
 
11.5%
3729
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII44943
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14981
33.3%
014981
33.3%
29074
20.2%
15178
 
11.5%
3729
 
1.6%

engnat
Categorical

Distinct2
Distinct (%)< 0.1%
Missing47
Missing (%)0.3%
Memory size117.3 KiB
1.0
9779 
2.0
5174 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44859
Distinct characters4
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row1.0
3rd row2.0
4th row1.0
5th row2.0

Common Values

ValueCountFrequency (%)
1.09779
65.2%
2.05174
34.5%
(Missing)47
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.09779
65.4%
2.05174
34.6%

Most occurring characters

ValueCountFrequency (%)
.14953
33.3%
014953
33.3%
19779
21.8%
25174
 
11.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29906
66.7%
Other Punctuation14953
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014953
50.0%
19779
32.7%
25174
 
17.3%
Other Punctuation
ValueCountFrequency (%)
.14953
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44859
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14953
33.3%
014953
33.3%
19779
21.8%
25174
 
11.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII44859
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14953
33.3%
014953
33.3%
19779
21.8%
25174
 
11.5%

age
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct76
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.74086667
Minimum13
Maximum38822
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size117.3 KiB

Quantile statistics

Minimum13
5-th percentile14
Q117
median20
Q327
95-th percentile49
Maximum38822
Range38809
Interquartile range (IQR)10

Descriptive statistics

Standard deviation317.0584356
Coefficient of variation (CV)11.85670007
Kurtosis14947.95196
Mean26.74086667
Median Absolute Deviation (MAD)4
Skewness122.1567763
Sum401113
Variance100526.0516
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
181386
 
9.2%
171313
 
8.8%
161239
 
8.3%
191101
 
7.3%
20932
 
6.2%
15899
 
6.0%
21833
 
5.6%
22663
 
4.4%
23605
 
4.0%
14533
 
3.6%
Other values (66)5496
36.6%
ValueCountFrequency (%)
13353
 
2.4%
14533
 
3.6%
15899
6.0%
161239
8.3%
171313
8.8%
181386
9.2%
191101
7.3%
20932
6.2%
21833
5.6%
22663
4.4%
ValueCountFrequency (%)
388221
< 0.1%
7221
< 0.1%
5451
< 0.1%
3361
< 0.1%
1232
< 0.1%
1001
< 0.1%
991
< 0.1%
881
< 0.1%
811
< 0.1%
801
< 0.1%

hand
Categorical

Distinct3
Distinct (%)< 0.1%
Missing47
Missing (%)0.3%
Memory size117.3 KiB
1.0
12794 
2.0
1550 
3.0
 
609

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44859
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row1.0
3rd row1.0
4th row2.0
5th row2.0

Common Values

ValueCountFrequency (%)
1.012794
85.3%
2.01550
 
10.3%
3.0609
 
4.1%
(Missing)47
 
0.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.012794
85.6%
2.01550
 
10.4%
3.0609
 
4.1%

Most occurring characters

ValueCountFrequency (%)
.14953
33.3%
014953
33.3%
112794
28.5%
21550
 
3.5%
3609
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29906
66.7%
Other Punctuation14953
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014953
50.0%
112794
42.8%
21550
 
5.2%
3609
 
2.0%
Other Punctuation
ValueCountFrequency (%)
.14953
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44859
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14953
33.3%
014953
33.3%
112794
28.5%
21550
 
3.5%
3609
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII44859
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14953
33.3%
014953
33.3%
112794
28.5%
21550
 
3.5%
3609
 
1.4%

religion
Real number (ℝ≥0)

MISSING

Distinct12
Distinct (%)0.1%
Missing245
Missing (%)1.6%
Infinite0
Infinite (%)0.0%
Mean4.098339546
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size117.3 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q36
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.425699163
Coefficient of variation (CV)0.8358749012
Kurtosis0.02402705209
Mean4.098339546
Median Absolute Deviation (MAD)1
Skewness1.084271844
Sum60471
Variance11.73541476
MonotonicityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
24335
28.9%
13665
24.4%
41674
 
11.2%
61317
 
8.8%
121233
 
8.2%
71198
 
8.0%
10483
 
3.2%
8246
 
1.6%
3216
 
1.4%
9211
 
1.4%
Other values (2)177
 
1.2%
(Missing)245
 
1.6%
ValueCountFrequency (%)
13665
24.4%
24335
28.9%
3216
 
1.4%
41674
 
11.2%
5156
 
1.0%
61317
 
8.8%
71198
 
8.0%
8246
 
1.6%
9211
 
1.4%
10483
 
3.2%
ValueCountFrequency (%)
121233
8.2%
1121
 
0.1%
10483
 
3.2%
9211
 
1.4%
8246
 
1.6%
71198
8.0%
61317
8.8%
5156
 
1.0%
41674
11.2%
3216
 
1.4%

orientation
Categorical

MISSING

Distinct5
Distinct (%)< 0.1%
Missing399
Missing (%)2.7%
Memory size117.3 KiB
1.0
8331 
2.0
3157 
4.0
1254 
3.0
987 
5.0
872 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters43803
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4.0
2nd row1.0
3rd row2.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.08331
55.5%
2.03157
 
21.0%
4.01254
 
8.4%
3.0987
 
6.6%
5.0872
 
5.8%
(Missing)399
 
2.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.08331
57.1%
2.03157
 
21.6%
4.01254
 
8.6%
3.0987
 
6.8%
5.0872
 
6.0%

Most occurring characters

ValueCountFrequency (%)
.14601
33.3%
014601
33.3%
18331
19.0%
23157
 
7.2%
41254
 
2.9%
3987
 
2.3%
5872
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29202
66.7%
Other Punctuation14601
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014601
50.0%
18331
28.5%
23157
 
10.8%
41254
 
4.3%
3987
 
3.4%
5872
 
3.0%
Other Punctuation
ValueCountFrequency (%)
.14601
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common43803
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14601
33.3%
014601
33.3%
18331
19.0%
23157
 
7.2%
41254
 
2.9%
3987
 
2.3%
5872
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII43803
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14601
33.3%
014601
33.3%
18331
19.0%
23157
 
7.2%
41254
 
2.9%
3987
 
2.3%
5872
 
2.0%

voted
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing85
Missing (%)0.6%
Memory size117.3 KiB
2.0
9443 
1.0
5472 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44745
Distinct characters4
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row1.0
3rd row2.0
4th row2.0
5th row2.0

Common Values

ValueCountFrequency (%)
2.09443
63.0%
1.05472
36.5%
(Missing)85
 
0.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
2.09443
63.3%
1.05472
36.7%

Most occurring characters

ValueCountFrequency (%)
.14915
33.3%
014915
33.3%
29443
21.1%
15472
 
12.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29830
66.7%
Other Punctuation14915
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014915
50.0%
29443
31.7%
15472
 
18.3%
Other Punctuation
ValueCountFrequency (%)
.14915
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44745
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14915
33.3%
014915
33.3%
29443
21.1%
15472
 
12.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII44745
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14915
33.3%
014915
33.3%
29443
21.1%
15472
 
12.2%

married
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing82
Missing (%)0.5%
Memory size117.3 KiB
1.0
12753 
2.0
1646 
3.0
 
519

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44754
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row2.0
3rd row3.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.012753
85.0%
2.01646
 
11.0%
3.0519
 
3.5%
(Missing)82
 
0.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.012753
85.5%
2.01646
 
11.0%
3.0519
 
3.5%

Most occurring characters

ValueCountFrequency (%)
.14918
33.3%
014918
33.3%
112753
28.5%
21646
 
3.7%
3519
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29836
66.7%
Other Punctuation14918
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014918
50.0%
112753
42.7%
21646
 
5.5%
3519
 
1.7%
Other Punctuation
ValueCountFrequency (%)
.14918
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44754
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14918
33.3%
014918
33.3%
112753
28.5%
21646
 
3.7%
3519
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII44754
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14918
33.3%
014918
33.3%
112753
28.5%
21646
 
3.7%
3519
 
1.2%

familysize
Real number (ℝ≥0)

MISSING
SKEWED

Distinct20
Distinct (%)0.1%
Missing319
Missing (%)2.1%
Infinite0
Infinite (%)0.0%
Mean2.744091002
Minimum1
Maximum2919
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size117.3 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q33
95-th percentile5
Maximum2919
Range2918
Interquartile range (IQR)1

Descriptive statistics

Standard deviation24.10934383
Coefficient of variation (CV)8.785912643
Kurtosis14585.55984
Mean2.744091002
Median Absolute Deviation (MAD)1
Skewness120.5748218
Sum40286
Variance581.2604598
MonotonicityNot monotonic
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
26303
42.0%
33573
23.8%
12373
 
15.8%
41445
 
9.6%
5535
 
3.6%
6256
 
1.7%
797
 
0.6%
853
 
0.4%
914
 
0.1%
1010
 
0.1%
Other values (10)22
 
0.1%
(Missing)319
 
2.1%
ValueCountFrequency (%)
12373
 
15.8%
26303
42.0%
33573
23.8%
41445
 
9.6%
5535
 
3.6%
6256
 
1.7%
797
 
0.6%
853
 
0.4%
914
 
0.1%
1010
 
0.1%
ValueCountFrequency (%)
29191
 
< 0.1%
392
 
< 0.1%
232
 
< 0.1%
191
 
< 0.1%
171
 
< 0.1%
161
 
< 0.1%
142
 
< 0.1%
135
< 0.1%
123
< 0.1%
114
< 0.1%

ASD
Categorical

Distinct2
Distinct (%)< 0.1%
Missing89
Missing (%)0.6%
Memory size117.3 KiB
2.0
13997 
1.0
 
914

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters44733
Distinct characters4
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row2.0
3rd row2.0
4th row2.0
5th row2.0

Common Values

ValueCountFrequency (%)
2.013997
93.3%
1.0914
 
6.1%
(Missing)89
 
0.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
2.013997
93.9%
1.0914
 
6.1%

Most occurring characters

ValueCountFrequency (%)
.14911
33.3%
014911
33.3%
213997
31.3%
1914
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number29822
66.7%
Other Punctuation14911
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
014911
50.0%
213997
46.9%
1914
 
3.1%
Other Punctuation
ValueCountFrequency (%)
.14911
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common44733
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.14911
33.3%
014911
33.3%
213997
31.3%
1914
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII44733
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.14911
33.3%
014911
33.3%
213997
31.3%
1914
 
2.0%

nerdiness
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.3 KiB
1
8303 
0
6697 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters15000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row0

Common Values

ValueCountFrequency (%)
18303
55.4%
06697
44.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
18303
55.4%
06697
44.6%

Most occurring characters

ValueCountFrequency (%)
18303
55.4%
06697
44.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number15000
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
18303
55.4%
06697
44.6%

Most occurring scripts

ValueCountFrequency (%)
Common15000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
18303
55.4%
06697
44.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII15000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18303
55.4%
06697
44.6%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

df_indexQ1Q2Q3Q4Q5Q6Q7Q8Q9Q10Q11Q12Q13Q14Q15Q16Q17Q18Q19Q20Q21Q22Q23Q24Q25Q26countryintroelapsetestelapsesurveyelapseTIPI1TIPI2TIPI3TIPI4TIPI5TIPI6TIPI7TIPI8TIPI9TIPI10VCL1VCL2VCL3VCL4VCL5VCL6VCL7VCL8VCL9VCL10VCL11VCL12VCL13VCL14VCL15VCL16educationurbangenderengnatagehandreligionorientationvotedmarriedfamilysizeASDnerdiness
001.05.05.05.01.04.05.05.01.03.05.05.05.05.05.05.05.01.05.05.01.05.01.05.01.01.0USA355364.03.05.01.03.05.05.03.05.03.011011000010001112.013.01.0202.012.04.02.01.04.02.01
114.04.04.04.04.05.04.04.03.03.01.04.05.03.01.02.04.05.01.03.01.01.05.03.02.05.0USA5851204.02.03.05.03.02.05.01.02.02.011111010010011114.022.01.0491.02.01.01.02.04.02.01
224.05.05.04.03.05.05.05.04.04.02.05.05.05.01.03.05.03.05.02.02.01.02.04.02.05.0NLD91081001.02.03.01.05.05.03.04.05.02.011011011010011112.011.02.0431.02.02.02.03.04.02.01
334.04.04.02.04.03.03.05.03.04.05.02.02.04.04.02.04.05.04.03.03.04.03.04.04.02.0USA21211393.03.03.04.05.03.04.04.03.03.011011000010011111.031.01.0172.01.01.02.01.02.02.01
444.04.04.04.03.03.04.02.03.04.04.04.03.05.05.02.04.01.04.02.04.02.03.04.04.04.0ITA36402163.03.04.04.04.04.03.04.03.02.011011001010001011.022.02.0182.012.01.02.01.01.02.00
555.04.05.05.05.05.05.04.04.03.04.05.02.03.04.01.01.02.04.02.02.02.03.05.03.05.0USA31001765.03.03.03.05.02.04.05.03.01.011011001010011113.021.01.0261.01.01.01.01.01.01.01
664.03.04.03.05.04.05.04.05.05.03.05.05.03.03.03.05.05.03.04.04.03.05.05.03.05.0USA17881643.02.05.03.04.03.03.02.03.02.011011000010011114.032.02.0401.01.01.02.01.01.02.01
774.05.04.04.04.04.02.05.03.04.02.02.03.03.05.02.04.05.02.03.05.03.05.05.01.04.0NLD20531123.02.05.05.03.03.03.02.02.02.011110001011011113.012.02.0341.02.05.01.01.02.02.01
884.04.03.04.04.05.04.03.03.04.04.04.03.03.03.01.02.04.01.03.01.02.05.03.01.04.0USA91642133.01.04.03.04.03.04.01.03.03.011111100010011112.022.01.0201.07.01.01.01.03.02.00
993.03.04.03.04.02.04.02.04.04.05.04.01.04.02.05.04.04.05.03.01.01.02.04.04.02.0ARE1091341772.02.05.03.03.04.04.02.04.04.011011000010011112.021.02.0171.010.01.02.01.05.02.00

Last rows

df_indexQ1Q2Q3Q4Q5Q6Q7Q8Q9Q10Q11Q12Q13Q14Q15Q16Q17Q18Q19Q20Q21Q22Q23Q24Q25Q26countryintroelapsetestelapsesurveyelapseTIPI1TIPI2TIPI3TIPI4TIPI5TIPI6TIPI7TIPI8TIPI9TIPI10VCL1VCL2VCL3VCL4VCL5VCL6VCL7VCL8VCL9VCL10VCL11VCL12VCL13VCL14VCL15VCL16educationurbangenderengnatagehandreligionorientationvotedmarriedfamilysizeASDnerdiness
14990149905.05.05.03.05.01.05.05.05.03.03.01.01.05.01.03.01.03.05.01.05.01.03.05.01.05.0PAK51492163.03.04.05.05.05.05.05.05.01.010010000010000112.012.02.0151.010.0NaN2.01.05.02.00
14991149914.05.04.04.05.05.05.03.0NaN5.01.05.05.04.01.03.04.03.01.05.03.03.05.05.05.05.0USA4083821982.03.0NaN3.05.03.03.02.03.01.01111101001101111NaN12.01.0801.03.01.01.03.03.02.01
14992149925.05.05.05.05.04.05.05.05.05.05.05.05.05.04.05.05.05.05.05.05.03.05.04.05.05.0CAN3771892.03.03.02.04.05.03.04.04.01.011111010010111113.012.02.0241.012.02.01.01.03.01.01
14993149935.02.01.03.04.03.01.04.04.03.01.01.03.02.01.02.02.02.01.03.04.01.02.04.01.04.0USA5871474.04.03.03.04.02.03.03.03.01.011011000010000111.022.01.0131.01.01.02.01.03.02.00
14994149945.05.02.05.05.05.05.05.05.05.01.05.05.05.04.05.05.05.02.05.05.03.05.05.03.05.0AUS466808103.03.03.03.04.03.05.05.03.02.011111010011011113.022.01.0361.012.05.02.01.05.02.01
14995149952.05.04.03.03.04.04.04.03.04.01.04.04.03.04.04.02.05.02.04.01.02.05.04.02.04.0USA121031612.02.04.03.03.05.03.03.03.03.011111000010111112.022.01.0171.01.03.02.01.03.02.00
14996149965.04.05.04.04.05.05.04.04.05.01.04.04.04.02.05.05.04.01.05.03.04.04.05.04.05.0USA311061793.02.04.05.04.03.04.01.02.02.011011000010001114.012.02.0451.03.01.01.02.03.02.01
14997149974.05.05.05.05.05.05.05.04.05.04.05.05.05.05.05.05.05.04.05.05.02.05.05.03.04.0USA171031681.03.02.05.01.05.03.03.01.01.011011000010011112.022.01.0201.01.02.01.01.03.01.01
14998149985.05.04.05.05.05.05.01.05.05.03.05.04.04.01.05.04.05.05.02.05.03.05.03.03.05.0USA14681091.01.03.05.04.05.05.04.02.01.011111001010111113.022.01.0291.012.04.02.02.02.01.00
14999149995.04.02.05.02.02.04.02.04.04.04.04.03.03.04.03.03.03.05.03.04.02.03.04.03.04.0BRA81823484.03.03.03.04.02.04.04.03.02.011111000111111112.031.02.0211.02.02.02.01.01.02.01